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Colibacillosis, caused by avian pathogenic Escherichia coli (APEC), is a significant disease, causing extensive animal and finan- 
cial losses globally. Because of the significance of this disease, more knowledge is needed regarding APEC's mechanisms of viru- 
lence. Here, we present the fully closed genome sequence of a typical avian pathogenic E. coli strain belonging to the serogroup 
078. 
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Colibacillosis, caused by avian pathogenic Escherichia coli 
(APEC), is one of the most significant infectious diseases af- 
fecting poultry (1-7). Poultry colibacillosis takes many forms, 
with systemic forms occurring most often (2). Collectively, these 
diseases result in annual multimillion-dollar losses due to mortal- 
ity, decreased production, and condemnations (1, 3, 5). Indeed, 
colibacillosis poses a profound threat to one of humankind's 
cheapest sources of high-quality animal protein. Despite the im- 
portance of this disease, the mechanisms of APEC virulence re- 
main largely unknown. Studies into APEC pathogenesis would be 
enhanced by public access to high-quality genomic sequences. To 
date three APEC sequences are publically available. The sequence 
of APEC Ol, an 01:K1:H7 strain isolated from the lung of a tur- 
key, is fully closed (9). Obtained from gelatinous edema lesions 
from a laying hen, a draft sequence of a Brazilian APEC strain, 
SCI-07, a member of the O nontypeable:H31 serotype, is in 68 
contigs (10), and a sequence of an 078 strain (^7122) was recently 
released in 12 contigs (11). Here, we describe a fully closed and 
annotated sequence of an 078 strain with the idea that fully closed 
sequences representative of the most commonly isolated APEC 
serogroups, such as Ol and 078 strains, are needed to adequately 
support future colibacillosis research (1). 

APEC 078 is an 078 strain isolated from the lung of a turkey 
clinically diagnosed with colibacillosis. Genomic sequencing was 
performed using complementary sequencing technologies, com- 
bining results obtained with a Roche/454 FLX genome sequencer 
(GS) instrument and an Illumina Hi-Seq 2000. The following da- 
tasets were used in the final assembly: (i) GS-FLX, with 590,773 
shotgun reads totaling 237 Mb (~49-fold coverage); (ii) GS-FLX 
8-kb mate-pair library with 474,583 shotgun reads totaling 
168 Mb (~35-fold coverage) of which 330,857 were paired; and 
(iii) Illumina 100 bp paired-end library with 27,389,600 reads to- 
taling 2,587 Mb (~539-fold coverage). Both 454-read sets were 
assembled de novo using Newbler 2.6 (Roche), and Illumina reads 
were assembled separately with Velvet 1.1 (8) and Illumina's 



ELANDv2e assembler. The genome was closed using 454 assem- 
blies as a "reference" sequence and the Illumina data to add depth, 
correct errors, and close gaps. Whole-genome optical mapping 
(OpGen, Gaithersburg, MD) was used to validate scaffolds and 
contig order. The assembly was confirmed using PCR and Sanger 
sequencing and validated by consistency of paired-end evidence 
from 454 and Illumina reads. 

Annotation was automated using the NCBI Prokaryotic Ge- 
nomes Automatic Annotation Pipeline (PGAAP). The final ver- 
sion was checked against the previously completed Prokka 1.5.2 
annotation. 

The assembled genome consists of a single chromosome 
(4,798,435 bp; 50.68% GC content) and two plasmids, one 
217.830 kb and the other 113.260 kb. The chromosome contains 
4,696 protein-encoding genes, 88 tRNA-carrying genes, and 19 
rRNA-carrying operons. The chromosome of APEC 078 is 
smaller than many other fully sequenced extraintestinal patho- 
genic E. coli (ExPEC) genomes, and its chromosomal structure 
appears different from those of other ExPEC genomes. Assess- 
ment of the implications of these differences is ongoing, but the 
addition of a fully closed genomic sequence of one of the com- 
monly occurring serogroups among APEC significantly contrib- 
utes to the toolset that can be used in studies of APEC pathogen- 
esis and colibacillosis control. 

Nucleotide sequence accession number. Complete sequences 
of APEC 078 have been deposited in GenBank under accession 
number CP004009. 
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